CDS

Accession Number TCMCG024C56326
gbkey CDS
Protein Id XP_022039339.1
Location complement(join(168930784..168931014,168931115..168931258,168931332..168931419,168932583..168932688,168932840..168932994,168933526..168933595,168933880..168934046,168934605..168934672,168934926..168935029,168935113..168935283,168935352..168935489,168935584..168935767,168936901..168937302))
Gene LOC110941948
GeneID 110941948
Organism Helianthus annuus

Protein

Length 675aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022183647.2
Definition serine protease SPPA, chloroplastic [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category OU
Description protease
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K04773        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTCTAGGTTTCTCACCACCTCCGTCCACATCTCCGCCGCTATCCTCACCAAATCCCGCTCTCCTCTTTACCTACCTCCCTCTTCTTTCTCCCCTCCGCATTTCACTTCCACTTACCACCGTCATCTTCCTAAACCTCACCGCTCCATTTCAATCCGAGCCGTTGATTCCTCATCGGATACCAAAAGTGAGGATGTTTCGTCGGAGGATAGGACCGAGTTGAAGTCGGAATTGGATAGTAATGGCAGTTTGAGAGGCGATGGTGATTATCCTAGCGGTGAATTTGAATTTGAAACGCCAGGCGCGTGGAAAAGCTTCGTGGTGAAGCTGCGGATGCTAATTGCTTATCCCTGGCAACGTGTTCGTAAAGGCAGTGTTCTCAATTTGAAATTGCGTGGACAGATATCGGATCAGGTGAAGACCCGATTCTCTTCGGGGTTATCCCTGCCTCAAATCTGTGAAAACTTGATAAAGGCAGCATACGATCCTCGTATATCTGGTGTTTATCTTCATATCGAAACCCTGAACTGTGGGTGGGCTAAGATTGAAGAAATTAGAAGACACATATTGGATTTTAGAAAGTCAGGAAAGTTCATTATTGGTTACGCACCTCTCTGGGGTGAAAAGGAGTATTACCTTGGTTGTGCCTGTGAAGAACTGTACGCCCCTCCAAGTGCGTATTTTTCATTATACGGTTTAACTGCTCAAGCACAATTTCTTGGAGGTGTACTTGAGAAAGTAGGCGTGGAACCACAAGTGCAGAGAATCGGTAAATATAAAAGTGCTGGTGATCAGTTAACTCGCAAGAATATATCTGAAGAAAACCGTGAGGTGCTTAATACATTGCTTGATAATATCTATGGAAATTGGGTTGATAAGATTTCTCAAGCCAAAGGAAAGAAAAAGGAAGAAATCGAGAGTTTTATCAATGAAGGAGTTTACCAAATAGATAAGTTGAAAGAAGATGGATGGATTACAGATATTAAATATGATGATGAGGTTACATCTATGTTGAAAAAAAAATTGGGCATTGCAGAAGAGAAAAAACTCCCGATTGTTGATTACAAGAAATACTCAAAAGTTAAGAAATGGACTGTGGGGTTATCTGGTGGCAAGGATAAAATTGCGGTAATTAGAGCTTCCGGTAGCATCAGTCGTGTACGGGGACCATTTAGTTCACCTAGTTCAGGCATTATTGCTGAGCAATTCATTGAGAAGATTCGCAGCGTACGAGAGTCAAAAAGGTACAAGGCTGTTATCATCCGGATTGACAGCCCTGGAGGTGATGCTCTTGCTTCTGACTTGATGTGGAGGGAAATCAGACTACTGGCTGAATCTAAGCCTGTAGTTGCATCAATGGTTGATGTGGCAGCTAGTGGAGGATACTATATGGCCATGGCAGCACAAACTATACTCTCGGAGAACCTTACTTTGACTGGTTCAATTGGCGTTGTTACAGGTAAGTTCAATTTGGGGAAACTTTACGAAAGGATCGGTTTCAACAAGGAAGTTATTTCAAGGGGACGATTTGCTGAGTTGACTGCTGCTGACCAGCGGCCATTCAGACCCGATGAAGAGAAACTATTTGCGGAATCTGCTCAAAATGCTTACAAACAGTTTCGTAACAAGGCAGCATTTTCAAGATCAATGAGTGTAGATAAAATGGAGGAGTTTGCTCAAGGAAGAGTATGGAGTGGTAATGATGCCGCTTCACGGGGTTTAGTTGATGCAATTGGCGGCTTTTCACGGGCTGTCGCTATAGCCAAACACAAGGCCAACATACCTCAGGACAAACAGGTTACTTTGGTTGAGTTGTCAAGATCATCACCCTCTTTACCAGAAATCCTTAGTGGAATAGGGAGCTCGGTAATCGGGATAGACATGGCATTAAAGCAGCTAATGGATGGCTTAGCATCAAGCGACGGGGTGCAAGCCCGTATGGATGGAATCATGTTTCAAAGATCAGAAGGATCTTCATTTGCAAATCCTATTTTCAATCTGCTTAAAGACTACTTGAGTTCTCTTTGA
Protein:  
MSRFLTTSVHISAAILTKSRSPLYLPPSSFSPPHFTSTYHRHLPKPHRSISIRAVDSSSDTKSEDVSSEDRTELKSELDSNGSLRGDGDYPSGEFEFETPGAWKSFVVKLRMLIAYPWQRVRKGSVLNLKLRGQISDQVKTRFSSGLSLPQICENLIKAAYDPRISGVYLHIETLNCGWAKIEEIRRHILDFRKSGKFIIGYAPLWGEKEYYLGCACEELYAPPSAYFSLYGLTAQAQFLGGVLEKVGVEPQVQRIGKYKSAGDQLTRKNISEENREVLNTLLDNIYGNWVDKISQAKGKKKEEIESFINEGVYQIDKLKEDGWITDIKYDDEVTSMLKKKLGIAEEKKLPIVDYKKYSKVKKWTVGLSGGKDKIAVIRASGSISRVRGPFSSPSSGIIAEQFIEKIRSVRESKRYKAVIIRIDSPGGDALASDLMWREIRLLAESKPVVASMVDVAASGGYYMAMAAQTILSENLTLTGSIGVVTGKFNLGKLYERIGFNKEVISRGRFAELTAADQRPFRPDEEKLFAESAQNAYKQFRNKAAFSRSMSVDKMEEFAQGRVWSGNDAASRGLVDAIGGFSRAVAIAKHKANIPQDKQVTLVELSRSSPSLPEILSGIGSSVIGIDMALKQLMDGLASSDGVQARMDGIMFQRSEGSSFANPIFNLLKDYLSSL